Why and How Hippocampal Transition Cells Can Be Used in Reinforcement Learning

نویسندگان

Julien Hirel

Philippe Gaussier

Mathias Quoy

Jean-Paul Banquet

چکیده

In this paper we present a model of reinforcement learning (RL) which can be used to solve goal-oriented navigation tasks. Our model supposes that transitions between places are learned in the hippocampus (CA pyramidal cells) and associated with information coming from path-integration. The RL neural network acts as a bias on these transitions to perform action selection. RL originates in the basal ganglia and matches observations of reward-based activity in dopaminergic neurons. Experiments were conducted in a simulated environment. We show that our model using transitions and inspired by Q-learning performs more efficiently than traditional actor-critic models of the basal ganglia based on temporal difference (TD) learning and using static states.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Design Principles of the Hippocampal Cognitive Map

Hippocampal place fields have been shown to reflect behaviorally relevant aspects of space. For instance, place fields tend to be skewed along commonly traveled directions, they cluster around rewarded locations, and they are constrained by the geometric structure of the environment. We hypothesize a set of design principles for the hippocampal cognitive map that explain how place fields repres...

متن کامل

WHY AND HOW TO APPLY QUANTUM LEARNING AS A NEW APPROACH TO IMPLEMENTATION THE CURRICULUM

The present study was philosophical and analytical research that examines quantum learning as an effective approach to the curriculum in a qualitative way. It explored books, published essays, and related studies, and took some advantages of online materials on the issue from domestic and foreign sources. Because of large body of data on the issue, only the relevant information was included. Da...

متن کامل

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

In this paper, a model-free reinforcement learning-based controller is designed to extract a treatment protocol because the design of a model-based controller is complex due to the highly nonlinear dynamics of cancer. The Q-learning algorithm is used to develop an optimal controller for cancer chemotherapy drug dosing. In the Q-learning algorithm, each entry of the Q-table is updated using data...

متن کامل

A neural network model of adaptively timed reinforcement learning and hippocampal dynamics.

A neural model is described of how adaptively timed reinforcement learning occurs. The adaptive timing circuit is suggested to exist in the hippocampus, and to involve convergence of dentate granule cells on CA3 pyramidal cells, and N-methyl-D-aspartate (NMDA) receptors. This circuit forms part of a model neural system for the coordinated control of recognition learning, reinforcement learning,...

متن کامل

Towards Behavior-Aware Model Learning from Human-Generated Trajectories

Inverse reinforcement learning algorithms recover an unknown reward function for a Markov decision process, based on observations of user behaviors that optimize this reward function. Here we consider the complementary problem of learning the unknown transition dynamics of an MDP based on such observations. We describe the behavior-aware modeling (BAM) algorithm, which learns models of transiti...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2010

Why and How Hippocampal Transition Cells Can Be Used in Reinforcement Learning

نویسندگان

چکیده

منابع مشابه

Design Principles of the Hippocampal Cognitive Map

WHY AND HOW TO APPLY QUANTUM LEARNING AS A NEW APPROACH TO IMPLEMENTATION THE CURRICULUM

Reinforcement learning based feedback control of tumor growth by limiting maximum chemo-drug dose using fuzzy logic

A neural network model of adaptively timed reinforcement learning and hippocampal dynamics.

Towards Behavior-Aware Model Learning from Human-Generated Trajectories

عنوان ژورنال:

اشتراک گذاری